Using Monte Carlo Search with Data Aggregation to Improve Robot Soccer Policies

نویسندگان

  • Francesco Riccio
  • Roberto Capobianco
  • Daniele Nardi
چکیده

RoboCup soccer competitions are considered among the most challenging multi-robot adversarial environments, due to their high dynamism and the partial observability of the environment. In this paper we introduce a method based on a combination of Monte Carlo search and data aggregation (MCSDA) to adapt discrete-action soccer policies for a defender robot to the strategy of the opponent team. By exploiting a simple representation of the domain, a supervised learning algorithm is trained over an initial collection of data consisting of several simulations of human expert policies. Monte Carlo policy rollouts are then generated and aggregated to previous data to improve the learned policy over multiple epochs and games. The proposed approach has been extensively tested both on a soccer-dedicated simulator and on real robots. Using this method, our learning robot soccer team achieves an improvement in ball interceptions, as well as a reduction in the number of opponents’ goals. Together with a better performance, an overall more efficient positioning of the whole team within the field is achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic Planning in Large Search Spaces

Multi-agent planning approaches are employed for many problems including task allocation, surveillance and video games. In the first part of my thesis, we study two multi-robot planning problems, i.e. patrolling and task allocation. For the patrolling problem, we present a novel stochastic search technique, Monte Carlo Tree Search with Useful Cycles, that can generate optimal cyclic patrol poli...

متن کامل

A Monte Carlo-Based Search Strategy for Dimensionality Reduction in Performance Tuning Parameters

Redundant and irrelevant features in high dimensional data increase the complexity in underlying mathematical models. It is necessary to conduct pre-processing steps that search for the most relevant features in order to reduce the dimensionality of the data. This study made use of a meta-heuristic search approach which uses lightweight random simulations to balance between the exploitation of ...

متن کامل

Robust Monte Carlo Control Policies to Maneuver Tensegrity Robots out of Obstacles

Multiagent learning has been shown to be e↵ective in creating control policies for sophisticated soft-robotic systems based on tensegrity structures (built from interconnected rods and cables). The distributed nature of the tension network within a tensegrity structure along with its smooth distribution of forces is a natural match for distributed learning. Indeed, multiagent learning has been ...

متن کامل

Robo-Erectus Senior III (RESr-III): A Teen Size Humanoid Soccer Robot

This paper provides a brief description of Robo-Erectus Senior III– a teen size soccer playing humanoid robot developed at Advanced Robotics and Intelligent Control Centre of Singapore Polytechnic. The mechanical and electrical specifications of the robot are described. This paper also covers the vision processing, locomotion control, state-driven Monte Carlo localization and force/torque senso...

متن کامل

Comparison of Localization Methods for a Robot Soccer Team

In this work, several localization algorithms that are designed and implemented for Cerberus'05 Robot Soccer Team are analyzed and compared. These algorithms are used for global localization of autonomous mobile agents in the robotic soccer domain, to overcome the uncertainty in the sensors, environment and the motion model. The algorithms are Reverse Monte Carlo Localization (R-MCL), Simple Lo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016